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Abstract 

When designing a distributed control system, the system designer has a choice in how to connect the different 
units through communication channels. In practice, noiseless and noisy channels may coexist. Using the standard 
toy example of scalar stabilization, this paper shows how a small amount of noiseless feedback can perform a 
"supervisory" role and thereby boost the effectiveness of noisy feedback. 
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I. Introduction 

Distributed control has long been understood as a challenging problem, and in such situations the control 
signals often have a dual purpose — to achieve the control objectives as well as to communicate necessary 
information between the distributed systems [1], [2], [3]. Consequently, it had been recognized that there may lie 
deep connections between information theory and control theory [4], but establishing such connections remained 
out of reach. Recently however, some deep connections have begun to emerge, most notably in the context of 
stabilization over a limited communication medium. The reader is directed to the recent September 2004 issue of 
IEEE Transactions on Automatic Control and the articles (and references) therein for a more comprehensive survey, 
but it is worthwhile to make note of some relevant prior work here. 

The first truly information-theoretic connection was given by Tatikonda with his work on sequential rate-distortion 
theory. This married the causality constraint critical in control problems to the rate-distortion framework provided by 
Shannon. By using Massey's directed mutual information and relaxing the optimization to be over all channels with 
the same directed mutual information instead of the original channel, it provides a lower-bound on the achievable 
closed-loop performance of a control system over a channel, noisy or otherwise. Because this bound is sometimes 
infinite, it also implies that there is a fundamental rate of information production, namely the sum of the logs 
of the unstable eigenvalues of the plant, that is attached to an unstable linear discrete-time process [5], [6]. The 
corresponding achievability result was based on noiseless, but rate-limited, channels. Nair et al. extended the 
noiseless rate-limited story to the case of unbounded disturbances and observation noise under suitable conditions 
[7], [8]. 

The noisy channel side of the story is distinct and involves additional subtleties. Much work was tied to specific 
communication channels. We had previously showed that it is possible to stabilize persistently disturbed controlled 
Gauss-Markov processes over suitable power-constrained AWGN (Additive White Gaussian Noise) channels [9], 
[10] where it turns out that Shannon capacity is tight and linear observers and controllers are sufficient to achieve 
stabilization [11]. In contrast, we showed that the Shannon capacity of the binary erasure channel (BEC) is not 
sufficient to check stabilizability and introduced the anytime capacity as a candidate figure of merit in [12]. Following 
up on our treatment of the BEC case, Martins et al. have studied more general erasure-type models and have 
also incorporated bounded model uncertainty in the plant [13]. Elia used ideas from robust control to deal with 
communication uncertainty in a mixed continuous/discrete context, but restricting to linear operations [14], [15]. 

Our work in [16], [17] established another deep connection in the noisy channel case by showing how stabilization 
problems over noisy channels are equivalent to anytime communication problems where the encoder has access to 
feedback. These results are extended in [18] to the vector case. In stabilization problems, the sense of stability desired 
(mean-squared, fourth-moment, etc.) determines the target anytime reliability in conjunction with the unstable 
eigenvalues. In contrast, the rate requirement comes only from the unstable eigenvalues just as it does in the 
noiseless rate-limited channel case. 

The feedback anytime capacity is lower bounded by the anytime capacity without feedback in [19], [17]. The 
basic result is that the block random-coding error exponent E r {R) is a lower bound to the anytime reliability 
at rate R. Furthermore, implementable observers and controllers can be designed that approach this lower bound 
by using a nearly memoryless quantization strategy for the observer and a stack-algorithm based strategy for the 
controller [20], [21]. However, using purely information-theoretic arguments [22] shows that the anytime reliability 
with feedback can be much higher and provides a new upper bound called the "focusing bound." For symmetric 



channels, the focusing bound says that at rate R = ' p ' , the anytime reliability can not exceed Eq(p) where Eq(p) 
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Note that for symmetric channels, it suffices to use a uniform q while optimizing (Q [23]. 

[22] also shows that the focusing bound is asymptotically achievable for any erasure type channel or noisy 
channel that has a strictly positive zero-error capacity. (For non-erasure channels whose zero-error capacity is zero, 



2 



W t - 




Designed 
Observer 

O 



U t - 




1 Step 
Delay 



Control Signals 



Designed 
Controller 



Fig. 1. Control over a fortified communication channel. 



the achievability of the focusing bound is still open, although it is known that we can beat the lower-bound of 
E r (R) in essentially all cases [24], [25], [26].) The achievability strategy of [22] is interesting because it has 
strictly bounded computation at both the encoder and decoder and furthermore, it relies on sending very low-rate 
"supervisory" messages using the "noiseless" aspects of the communication channel. 

While it is possible to directly apply the results of [22] to the stabilization problem by means of the equivalence 
established in [16] and a separation architecture, it is more illuminating to give the strategy directly in the 
stabilization context without reference to anytime communication per-se. After setting up some notation and 
preliminaries in Section |H] we give the stabilization scheme in Section before concluding in Section [W] 
Proof ideas are given here, but the "heavy lifting" is found in [16], [22]. 

II. Preliminaries 
The scalar stabilization problem is illustrated in Figure ^ 

X t+1 = XX t + U t + W t , i>0 (2) 

where {X t } is a iR-valued state process. {Ut} is a JR-valued control process and {Wt} is a bounded noise/disturbance 
process s.t. \Wt\ < §. This bound is assumed to hold with certainty. For convenience, we also assume a known initial 
condition Xq = 0. To make things interesting, consider A > 1 so the open-loop system is exponentially unstable. 
The observer/encoder system O observes Xt and generates inputs at to the channel. The decoder/controller system 
C observes channel outputs B t and generates control signals Ut- Both 0,C are allowed to have unbounded memory 
and to be nonlinear in general. The goal is stability: 

Definition 2.1: A closed-loop dynamic system with state X t is ^-stable if there exists a constant K s.t. i?[|.Xi < 
K for all t > 0. 

Definition 2.2: A discrete time discrete memoryless channel (DMC) is a probabilistic system with an input. At 
every time step t, it takes an input x t € X and produces an output y t € y with probability p(y t \x t ). Both X,y 
are finite sets. The current channel output is independent of all past random variables in the system conditioned on 
the current channel input. 

The twist in this paper is that the noisy channel does not have be used alone. In addition, we have a low-rate 
noiseless channel to help out. 

Definition 2.3: Given a DMC P for the forward link, a ^-fortified channel built around it is one in which every 
k-th use of P is supplemented with the ability to transmit a single noise-free bit to the receiver. 
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Fig. 2. Fortification illustrated: the forward noisy channel uses are supplemented with regular low-rate use of a noiseless side channel. 

III. Main Result 

The main result of this paper is that the focusing bound induced fundamental limits are asymptotically achievable 
with even a tiny amount of fortification. 

Theorem 3.1: For a symmetric noisy memoryless channel and any amount of fortification < k < 00, it is 
possible to control the unstable scalar plant of (|2j over that fortified channel so that the //-moment of \X t \ stays 
finite for all time if there exists any e > so that E °!^^ > In A even if the observer is only allowed to observe 
the state X t corrupted by bounded noise. 

Proof: The statement of the theorem does not refer to the focusing bound, but the connection can be seen by 
defining p' = 77 + e. Then ^ > In A is just a statement about the implicit rate R' = Ea ^ ^ being high enough. 
By multiplying both sides by p', it is saying that the reliability Eo(p') > p'lnA > 77mA which is what is necessary 
for 77-stabilization. Thus, this theorem is essentially saying that the focusing bound can be asymptotically achieved 
with fortification. 

The control strategy itself is based on three ideas: 

• Feedback of channel outputs to the observer by making the plant "dance" in an observable way following 
[16]. This makes the proof simpler to understand, but is later eliminated by better leveraging the next idea: 

• Nonuniform sampling of the plant state at the observer for purposes of "codeword" generation for the noisy 
channel uses. The codewords themselves are generated using random coding. The plant state is resampled after 
the controller applies a "true" (as opposed to "dancing") control. 

• Supervisory use of the noiseless fortification channel by the observer to tell the controller how and when to 
apply a "true" control. 

They are covered in greater detail in the next few subsections. 

A. Communication through "dancing" 

The approach is illustrated in Figure [3] and detailed in [16], [27]. The goal is to communicate the noisy channel 
outputs back to the observer so it can guide the controller. The main control signal Ut is zero except when directed 
by the observer as depicted in Section IIII-CI The actual applied control is 

U' t = U t + F(b t )-\F{b t ^) (3) 

where F(bt) = 3T u bt and F u = f2 + (A + 1)T where T is the bound on the noise in state observation. If X Q (t) is 
the boundedly noisy observation of the state X t , the observer can compute X a (t) — XX a (t — 1) which is going to 
be Ul_ 1 + Wt-\ corrupted by a bounded noise of at most F from X a (t) and AT from XX a (t — 1). The total noise 
in observing U[_ x is thus F u and so with this choice of F, the channel output bt can be unambiguously determined 
by the observer. 

If we assume that there is a finite channel output alphabet, then because of the cancellation of the past F(bt-i), 
this dance has no lasting impact on the state and thus can be ignored for stability considerations. 

B. Nonuniform sampling and control choice 

As in [12], the controller must keep track of a total window of uncertainty in which the state lies. The observer 
also tracks this window so as to communicate to the controller. Call the current uncertainty window A t . At a 
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Fig. 3. How to communicate the channel outputs through the plant with state observations only. The controller restricts its main control signal 
to be either zero, or calculated with a delay of 1 time unit. It is adjusted by — XF(b±-i) to eliminate the effect of the past communication. 
The final control signal applied is shifted slightly to encode which bt was received. The observer uses the past bj^ 1 to align its decoding 
regions and then reads off b t by using X t +i — XXt. 



sampling time, the observer divides this window into exp(ni?) (large) different regions and sees where the state 
observation is. Each region is assigned an infinitely long iid codeword drawn uniformly from the channel input 
alphabet according to the optimizing q. Letters from this codeword are transmitted on the noisy channel until the 
state is resampled. 

If the controller faced an uncertain window of At at time t, then without any information, the window will grow 
according to: 

A t+1 = XA t + n (4) 

If the controller learns correctly which region the observation lies in, then the retrospective uncertainty shrinks to 
at most T + At exp(— nR). If this occurs T time steps in the future, the resulting uncertainty is at most 

A t+ T = X T (exp(-nR)A t + T + (5) 

and knowledge of the window allows the controller to apply a "true" control that will center this uncertainty window 
around 0. 

As in [12], it is how the randomness in A T compares to exp(ni?) that will determine whether or not the system 
is ^-stable in closed loop. 



C. Supervision through the noiseless channel outputs 

Due to the "dancing" of Section ITlI-AI the observer knows what channel outputs have been received so far. Thus, 
it can monitor whether the channel outputs are good enough for the controller to successfully decode which region 
the state was in at the last state sampling time. When that occurs, it sends a 1 over the noiseless channel and 
otherwise it sends a zero. Thus, the random variable T is the time till successful decoding of an infinitely long 
codeword chosen out of exp(ni?) possible codewords. So far, nR was just an arbitrary constant. To interpret it, 
think of In A < R < g °^+ e) . Choose p so that R = ^sjd. Think of n as a constant that is a large multiple of k. 

At this point, the detailed analysis in [22] explains what happens. The key points are summarized here. 

a. The T represent the inter-sampling times of the state. This process is iid across time since the codewords are 
randomly chosen and the noisy channel is memoryless. 

b. If p < 1 works, then the scheme described above works without modification and the random variable T is 
strictly dominated by n + T where the T is a geometric random variable with probability of failure given by 
exp(-E (p)). 
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c. If p > 1 is required, then the scheme above must be modified to use list decoding among the top I = \p] entries 
with the next [~log 2 t] noiseless channel uses used to disambiguate the list. This is a small overhead that is 
negligible when nR is large. With that modification, everything from (b) still holds in that T < n+ [log 2 t] k+T. 

At this point, we give the key ideas in the proof: 

Using (b), we can rewrite © to get 

A t+T = \ T (e W (-nR)A t + T + ^j) 

< X n+f (eM~nR)A t + T + ^) 
= X T (X n exp(-nR)A t + TX n + 



< A T (exp [-n(R - In A)] A t + TX n + 



A-l ' 

nx 1 - 



X- 1 



Call this upper bound A t +T so: 

At+T 



A T - n (exp [-n{R - In A)] A t + TX n + -— -) (6) 

A — f 



where we assume that the next successful control application after time t occurs at time t + T. 

It is worthwhile interpreting the random variable T as the inter-arrival time for a packet-erasure channel and 
exp [n(R — In A)] as the number of possible messages carried by a single packet. Since R > In A, the size of the 
packet can be made as large as we want by picking a large 1 n. Viewed this way, the uncertainty window A is very 
likely to shrink from one arrival to the next if it starts out large. 

Asymptotically, the problem behaves like the very large packet erasure-channel case with a packet-erasure 
probability of exp (—Eq (p)). For n large enough, the closed-loop system will have an ^-moment as long as 
X v exp(— Eq(p)) < 1 or by taking logs: Eq(p) > rjlnX which is true by assumption. 



D. Event-based sampling is simpler than "dancing " 

While the above three-part strategy makes for an easy proof, it invites criticism for being very unrealistic: 

• The "dancing" of the plant seems to be excessive. When the channel output alphabet is large, this gives a 
tremendous performance hit as the channel outputs are communicated back through the plant. This also does 
not work if there is a continuous or other infinite channel output alphabet. 

• The "observer" here essentially has to run a copy of the channel decoder in the controller as well as interpret 
the plant dancing. There is thus duplication of effort in the whole system. 

To overcome this, we just have to notice that essentially all the controller has to be told is when to stop decoding 
this block and when it is time to move on to the next block. We can view successful decoding as an "event" that 
defines when the observer will next "sample" the state process for the purpose of generating channel inputs. The 
above strategy uses the noiseless feedback of channel outputs through the plant to decide when successful decoding 
as occurred — however there is a simpler way to see this. Whenever the controller is successful in decoding the 
channel outputs, it will be able to apply a control that drives the state to the neighborhood of zero! 

This gives rise to modifications to the control strategy that do not rely on the finite channel output alphabet 
assumption: 

• The controller no longer encodes the channel outputs into the control signals by making the plant dance. 

• Instead of waiting for a "go ahead" signal from the observer through the fortification channel, the controller 
applies the control based on its tentative best decision so far. If it gets negative feedback through the fortification 
channel, it backs-out the tentative control 2 it had applied and crosses the corresponding message off the list 
of possibilities. It then moves on to the new top-scoring message and applies the corresponding control until 
it receives positive feedback. 

'Of course, picking a large n also makes the effect of the disturbance and observation noise greater, but it remains bounded no matter 
how large n is and that is all that matters when it comes to the existence of ^-moments. 
2 By adding —\Ut-i to its new control when it wants to countermand its previous control. 
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Fig. 4. The observer maintains two "nets" that it uses to catch the state. The outer "quantization net" represents everywhere the controller 
believes the state could possibly be based on the information it is known to have received so far. At sampling time, it is partitioned into e nR 
uniformly sized boxes and the box corresponding to the true state is used to pick a message to send over the noisy channel. The inner net 
(of size At+T at time T when the last sampling event occurred at time t) represents where the controller could drive the state given that 
it was able to successfully decode the current message block. Both nets grow steadily exponentially at rate ~ A per unit time, and shrink 
suddenly whenever the controller is able to successfully decode and have this success confirmed through a fortified bit. 

• Instead of using the fortification channel to tell the controller when to stop decoding and apply a control, the 
observer uses the fortification channel to simply tell the decoder when it has gotten the right answer. This can 
be observed at the observer by seeing the plant state move into the vicinity 3 of the origin. 

• Otherwise, the observer remains essentially as before when it comes to sampling the controlled state and 
determining what infinitely-long random codeword to start sending incrementally. 

In terms of analysis, there are only two remaining concerns: 

a) False control performance hit: Before a true control action is confirmed, the controller might try some 
different false controls. However, a false control can at worst move the state to twice the outer limits of the current 
uncertainty window. This happens when the true control should be dealing with one extreme endpoint and the false 
applied control corresponds to the other endpoint. However, this factor of 2 just changes the bound on the 77-th 
moment by a factor of 2 V . It will not make a moment infinite if it were finite before. 

b) List decoding: In the original scheme above that was made to leverage the purely channel coding derivation 
in [22], list-decoding was required to disambiguate some remaining uncertainty. This caused this small constant 
(1 + log 2 l)k term to show up along with the original n. But since / is a constant that does not depend on n, the 
log 2 I is not necessary for the result to hold. Everything will work just as well asymptotically even if we used a 
"unary encoding" and used (/ + l)k time-slots for the list-disambiguation. This is essentially what the new observer 
does since it just keeps ruling out the seemingly most likely codewords until the true one is applied. 

Consequently, the event-based sampling approach that uses the fortification to give simple positive and negative 
supervision to the controller will achieve the focusing-bound performance. □ 

IV. Conclusion 

The main result of this paper is that a little bit of noiseless feedback used for supervision can be used to leverage 
the effectiveness of the noisy feedback. The difference in what can be achieved is illustrated in Figure |5] With even 
a tiny bit of noiseless feedback, we can achieve performance corresponding to the upper curves. 

The theorem as stated is asymptotic in nature and it might appear troubling to have to use large n and thereby 
significantly sacrifice performance just in order to stabilize larger moments. However, by applying the "supervisory" 
idea again, no such sacrifice is needed. The control strategy of [20] can be applied and most of the time, it will 
hold the plant very close to the origin. Rarely, the state will wander out of a particular target ball. At that point, 
the observer can use the noiseless channel uses to declare an "emergency" and switch to the control strategy given 
in this paper. Once the state has returned to a moderate sized neighborhood of the origin, the state of emergency 
can be lifted and the control strategy can return to that of [20]. By making the target ball large, emergencies will 
only occur rarely and thus will not impact average performance. However, the emergency mode will ensure that as 
many higher moments of the state will exist as is possible for the particular channel in question. 

3 By vicinity, we simply mean that the observed state lands within a box of size A t +r of the origin when considering time t + T. 
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Fig. 5. The sphere-packing and focusing bounds for the binary symmetric channel with crossover probability 0.02. On the left, reliability 
is plotted relative to rate, while on the right, the the stabilized moment is plotted relative to the unstable gain A. The focusing bound is the 
highest and bounds what is possible with feedback, the sphere-packing bound is below and bounds the reliability for communication without 
feedback. The random coding error exponent matches that bound at high-rates, but falls below at low rates. Since stabilization is equivalent 
to communication with feedback, the upper bound is the target. 
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